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We have targeted the activation of an endogenous 
chromosomal locus including the human erythropoietin 
gene using synthetic transcription factors. These tran- 
scription factors are targeted to particular DNA se- 
quences in the 5 '-flanking region of the erythropoietin 
gene through engineering of a zinc finger DNA binding 
domain. The DNA binding domain is linked to a VP16 
transcriptional activation domain. We find that these 
synthetic transcription factors invariably activate tran- 
siently transfected templates in which sequences within 
the 5' flank of the erythropoietin gene are fused to a 
luciferase reporter. The efficiency of activation under 
these circumstances at a defined site is dependent on 
DNA binding affinity. In contrast, only a subset of these 
same zinc finger proteins is able to activate the endog- 
enous chromosomal locus. The activity of these proteins 
is influenced by their capacity to gain access to their 
recognition elements within the chromatin infrastruc- 
ture. Zinc finger transcription factors will provide a 
powerful tool to probe the determinants of chromatin 
accessibility and remodeling within endogenous chro- 
mosomal loci. 



The enormous progress in our understanding of gene control 
in eukaryotes using model systems presents substantial oppor- 
tunities to apply this knowledge for therapeutic benefit in man. 
The rational design and engineering of components of the tran- 
scriptional machinery provide a powerful means to test conven- 
tional paradigms for the roles of protein-DNA and protein- 
protein interactions in gene regulation. These designer 
transcription factors may also provide novel means of regulat- 
ing endogenous chromosomal loci for a variety of beneficial 
purposes. Over the past decade, the primary structural deter- 
minants of DNA recognition by zinc fingers of the Cys 2 -His 2 
type have been elucidated (1-8). Designer transcriptional reg- 
ulators containing three or more zinc finger domains have been 
used in isolation (9, 10) or following linkage to transcriptional 
activation (9, 11-13) or repression domains (12, 13). These 
novel proteins control the transcription of reporter genes both 
transiently transfected into human cells (9, 11-13) and endog- 
enous chromosomal loci (9, 13). Exactly how these regulatory 
functions are exerted remains to be resolved. An important 
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issue in considering transcription factor function in eukaryotes 
is the capacity of the regulator to gain access to specific sites in 
chromatin and recruit transcriptional co-activators and co-re- 
pressors that modify the chromatin environment (14). These 
issues have been investigated for the archetypal Cys 2 -His 2 zinc 
finger protein and transcriptional regulator TFIIIA 1 (15-26). 
There is general agreement that the nucleosome can impede 
recognition of specific promoter elements by TFIIIA (15, 17-26) 
and that modification of histone-DNA interactions through nu- 
cleosome repositioning (18, 19), histone depletion (20, 21), and 
removal of the histone tails (22, 23) can promote TFIIIA bind- 
ing to a nucleosomal infrastructure. Accumulation of histone 
HI in chromatin can specifically interfere with TFIIIA function 
in vivo (24, 25) in a process that involves the repositioning of 
nucleosomes (25, 26). These studies demonstrate the role of 
chromatin infrastructure access by transcription regulators 
containing zinc finger DNA binding domains of the Cys 2 -His 2 
type. 

In this work, we first designed 10 novel zinc finger DNA 
binding domains to recognize specific 9-bp sequences in the 5' 
flank of the erythropoietin gene and characterized their unique 
DNA recognition selectivities and variable affinities for DNA. 
These zinc finger domains were then linked to the VP16 tran- 
scriptional activation domain (27) and tested for their capacity 
to activate transcription in both transient transfections and 
from endogenous chromosomal loci. Our results indicate that 
all of our synthetic regulators that bind DNA in vitro with 
dissociation constants <10 nM can activate transiently trans- 
fected templates. This activation of a particular site is depend- 
ent on DNA binding affinity. In contrast, only a subset of the 
synthetic regulators can activate the endogenous chromosomal 
locus. 

We find that the regulators that work in the endogenous 
chromosome can bind within the chromatin infrastructure, but 
that the differential binding of distinct regulators at a partic- 
ular site is largely independent of primary DNA binding affin- 
ity. These studies indicate that chromosome and chromatin 
organization is a determinant of zinc finger transcription factor 
function within endogenous chromosomal loci. 

EXPERIMENTAL PROCEDURES 

Design, Synthesis, and Purification of Zinc Finger Proteins — The 
foundation for our design strategy was to scan 1000 bp of the 5'- 



1 The abbreviations used are: TFIIIA, transcription factor IIIA; bp, 
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EPOZFP, erythropoietin-directed zinc finger proteins. 
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Fig. 1. The construction scheme of 
EPOZFPs. A, the structure of an individ- 
ual zinc finger with two 0-sheets linked to 
the DNA-binding a- helix. Oligos 1, 3, and 
5 comprise the p-sheet regions, and oligos 
2, 4, and 6 comprise the DNA-binding 
a-helix regions. B y the assembly scheme 
of the ZFPs. Six overlapping oligonucleo- 
tides were annealed and amplified with a 
pair of external oligonucleotides. The 
PCR products were then cut with Kpnl 
and BamHL before cloning into the 
pMalC2 bacterial expression vector. C, 
scheme of the maltose-binding EPOZFP 
fusions. 
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flanking sequences of the human erythropoietin (EPO) gene to choose 
the best candidate recognition elements for which to design zinc finger 
DNA binding domains. The human transcription factor Spl (amino 
acids 532-624) was used as the backbone to assemble 10 distinct zinc 
finger DNA binding domains. The design determinants of these pro- 
teins will be described elsewhere 2 ; however, the amino acid sequences 
chosen to recognize particular sequences are illustrated in Table I. Our 
strategy to synthesize the zinc finger DNA binding domains is outlined 
in Fig. 1. To create the synthetic genes encoding EPO-directed zinc 
finger proteins (EPOZFPs), a polymerase chain reaction (PCR)-based 
assembly procedure was applied using six overlapping oligonucleotides. 
Three oligonucleotides for the zinc finger coding sequences encode por- 
tions of the DNA binding domain containing the 0 sheet and linker 
regions between the a -helical DNA recognition sequences of the Spl 
zinc finger DNA binding domain scaffold (Fig. L4, oligos 1, 3, and 5). 
The other three oligonucleotides encode the recognition helices (oligos 2, 
4, and 6). The overlap between adjacent oligonucleotides is 15 base 
pairs. The PCR synthesis was carried out in two steps (Fig. IB). First, 
the double- stranded DNA template was created by combining the six 
oligos and filling the gaps with a four-cycle PCR reaction (using Taq and 
Pfu thermostable DNA polymerases). The annealing temperature was 
25 "C, a temperature at which the six oligos would anneal to form a 
DNA scaffold. In the second phase of construction, the template was 
amplified using a pair or external primers containing Kpnl and Bam HI 
restriction sites. The PCR products were directly cloned into the Tac 
promoter vector, pMal-c2 (New England Biolabs, Beverly, MA), using 
Kpnl and BamHI restriction sites. The zinc finger proteins were puri- 
fied as fusions with the maltose-binding protein (Fig. 1C) according to 
the manufacturer's instructions (New England Biolabs, Beverly, MA). 
The purified ZFPs (see Fig. 2A) were tested for their affinities for the 
DNA recognition sites within the 5' flank of the EPO gene. DNA 
oligonucleotides 30 base pairs in length that contain the various sites 
were synthesized, annealed, and end-labeled using polynucleotide ki- 
nase and [y- 32 P] ATP. Binding of the ZFPs to target oligonucleotides was 
performed by titrating protein against a fixed amount of duplex sub- 
strate. Twenty-/xl binding reactions contained 50 pM 5'- ;,2 P-labeled dou- 
ble-stranded target DNA, 10 mM Tris-HCl (pH 7.5), 100 mM KC1, 1 mM 
MgCl 2 , 1 mM dithiothreitol, 10% glycerol, 200 /ig/ml bovine serum 
albumin, 0.02% Nonidet P-40, and 100 um ZnCl 2 . Binding was allowed 
to proceed for 45 min at room temperature. Polyacryl amide gel electro- 
phoresis was carried out at room temperature using precast 10% or 
10-20% Tris-HCl gels (Bio-Rad) and Tris-glycine running buffer (25 mM 
Tris-HCl, 192 mM glycine (pH 8.3)). The radioactive signals were quan- 
titated with a Fhosphorlmager and autoradiographed. Once the DNA 
binding properties of the zinc finger DNA binding domains had been 
tested, these domains were subcloned into eukaryotic expression vec- 
tors. The vector used was generated based upon a ZFP-less expression 
vector pcDNA-NVF, which was modified from pcDNA3.1 (Invitrogen). 
pcDNA-NVF contains a CMV promoter driving expression of the coding 
sequence encoding a nuclear localization signal (Pro-Lys-Lys-Lys-Arg- 
Lys-Val) from SV40 large T antigen, a herpes simplex virus VP16 
activation domain (amino acids 413-490), and a Flag peptide (Asp-Tyr- 
Lys-Asp-Asp-Asp-Asp-Lys). All of the ZFP expression vectors were con- 
structed by subcloning of the ZFP fragments into the Kpnl and BamHI 
sites in pcDNA-NVF (see Fig. 5A). pEPOZFP-862c-NF plasmid is 
similar to EPOZFP -862c, except that the VP16 transactivation domain 
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esigns of the EPO-binding ZFPs 




Name 




Target sequence 
5' —3' 


Subsites 


Designs 




5' -» 3' 


-1 +6 








F1GCTC 


QSSDLQR 


CitryJLir r— 


QCQn 

'ob/,0. 


GCGGTGGCTc 


F2GTGg 
F1GCTC 


RSDALSR 
RSDERKR 
QSSDLQR 


EPOZFP- 


-862b 


GCGGTGGCTc 


F2GTGg 
F3GCGg 
FIGCTc 


RSDALSR 
RSDTLKK 
QSSDLTR 


EPOZFP- 


862c 


GCGGTGGCTc 


F2GTGg 
F3GCGg 
FlGAGt 


RSDALSR 
RSDERKR 
RSDNLAR 


EPOZFP- 


545a 


GGTGAGGAGt 


F2GAGg 
F3GGTg 
FlGAGt 


RSDNLAR 
DSSKLSR 
RSDNLAR 


EPOZFP- 


545b 


GGTGAGGAGt 


F2GAGg 
F3GGTg 
FIGGGa 


RSDNLAR 
MSDHLSR ~ 
RSDHLAR 


EPOZFP- 


535 


GAGGTGGGGa 


F2GTGg 
F3GAGg 
FlGCGg 


RSDALSR 
RSDNLSR 
RADTLRR 


EPOZFP- 


233 


TGGGTCGCGg 


F2GTCg 
F3TGGg 
FIGAGa 


DRSALAR 
RSDHLTT 
RSDNLAR 


EPOZFP- 


82 


GTGGGGGAGa 


F2GGGg 
F3GTGg 
FIGGGg 


RSDHLSR 
RSDALAR 
RSDHLAR 


EPOZFP- 


-72 


GCGGGTGGGg 


F2GGTg 
F3GCGg 
FIGGGt 


QSSHLAR 
RSDDLTR 
KTSHLRA 


Spl 




GGGGCGGGGt 


F2GCGg 
F3GGGg 
FIGCGt 


RSDELQR 
RSDHLSK 
RSDELTR 


Zif268 




GCGTGGGCGt 


F2TGGg 
F3GCGt 


RSDHLTT 
RSDERKR 



was removed. The pBS579 construct, which was used as a negative 
control, encodes a nonspecific ZFP gene. 

Cell Culture: Stable Inducible Cell Lines Expressing ZFPs, Lucifer ase 
Reporter Assays, Northern Blots, Taqman Analysis, ELISA, and West- 
ern Blots — Human embryonic kidney cells (HEK293) were grown in 
Dulbecco's modified Eagle's medium supplemented with 10% fetal calf 
serum. To generate stable Tet-inducible EPOZFP cell lines, the coding 
region from the pEPOZFP-862 cDNA was subcloned into pcDNA4TTO 
(Invitrogen) using Afllll and Hindi II restriction sites. The resulting 
pTO-EPOZFP-862 construct was transfected into the T-Rex-293™ 
(Invitrogen) cell line using LipofectAMINE (Life Technologies, Inc.). 
After 2 weeks of selection in medium containing Zeocin™ (Invitrogen), 
stable clones were isolated and analyzed for doxycycline (Dox)-depend- 
ent activation of ZFP expression. 

The luciferase reporter constructs were generated by annealing two 
complementary oligonucleotides in an annealing buffer containing 50 
mM NaCl, 10 mM Tris-HCl, 10 mM MgCl^, 1 mM dithiothreitol. The oligo 
mixture was heated at 95 °C for 5 min and allowed to cool down slowly 
to room temperature. The annealed oligonucleotides containing three 
tandem repeats of the ZFP target sequences were inserted into the 
pGL3 promoter vector (Promega) between the Mlul and Bglll sites 
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Fig. 2. The properties of EPOZFPs. 

A, schematic representation of the human 
EPO gene, showing key structural fea- 
tures. Horizontal arrow, start of tran- 
scription; open rectangle, coding region; 
hatched boxes, Alu elements; black boxes, 
targeted sequences, sites 1, 2, 3, and 4; 
wavy lines, CpG island. Abbreviations are 
as follows: Ba, BamHl; X, Xbal; Bg, Bglll; 
Pstl; HRE, hypoxic response element. 
Numbering is relative to the start site of 
transcription (+1). The targeted sites and 
the positions of the first nucleotide in the 
sequences is shown at the bottom. B, the 
protein gel shows all the purified 
EPOZFP, Spl, and Zif268 proteins. The 
proteins were expressed as maltose-bind- 
ing fusion proteins and purified as de- 
scribed (see "Experimental Procedures*"). 
The leftmost lane contains size markers. 
C, gel -shift assays using the various 
EPOZFPs binding to their target sites. 
The name of each EPOZFP is indicated by 
the number in the top left-hand corner of 
each gel panel. Proteins are used to bind 
to their targets in 2-fold serial dilutions, 
with the highest protein concentration in 
lane 2, and the lowest concentration in 
lane 16 from left to right. Lane 1 is a 
control lane containing radiolabeled DNA 
alone. The equilibrium dissociation con- 
stants of each EPOZFP are indicated at 
the top of each panel. 
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upstream of the SV40 promoter. All constructs were confirmed by DNA 
sequencing. 

Transient transfection was carried out using LipofectAMINE. Lucif- 
erase reporter assays were performed by co-transfection of ZFP effector 
DNA (50 ng), luciferase reporter DNA (900 ng), and pCMV-0-gal (100 
ng, used as an internal control) into HEK293 cells seeded in six- well 
plates. Cell lysates were harvested 40 h after transfection, and the 
luciferase activities were measured by the Dual -Light luciferase and 
0-galactosidase reporter assay system (Tropix). To assay the activation 
of the endogenous chromosomal EPO gene, we made use of established 
procedures to carry out Northern analysis of EPO mRNA. Briefly, 
poly(A) + RNA was isolated from either mock-transfected or pcV- 
EPOZFP-862-transfected HEK293 cells using the Oligotex kit (Qia- 
gen, Valencia, CA). 7 were resolved on a 2.4% agarose gel containing 



2.4 M formaldehyde and blotted onto Nytran SuPerCharge membrane 
using 20 X SSC. The membrane was hybridized at 65 °C for 1 h in 
Rapid-Hyb Buffer (Amersham Pharmacia Biotech) containing 32 P-la- 
beled EPO cDNA probe. The same membrane was re-hybridized with a 
32 P-labeled GAPDH DNA probe after stripping the EPO probe. The 
EPO cDNA construct, pcEPO was generated by inserting a human EPO 
cDNA fragment obtained by PCR amplification into the pcDNA3.1 
vector (Invitrogen) at the Xbal and 2?coRI sites. The clone was con- 
firmed by sequencing. The pTBAHVP16 plasmid, which was used to 
generate riboprobes for detection of both human )3-actin and ZFP genes, 
was generated by inserting the VP16 fragment from the pcDNANVF 
vector into the pTRI-/3-actin-125-human vector (Ambion, Austin, TX). 
The pcDNA-NVF DNA was digested with Xhol, repaired with Klenow 
enzyme, and digested again with BamHl. The TRI-0-actin vector was 



Transcriptional Activation of Chromatin 



33853 



digested with Smal and BamHI. 

For Taqman analysis of mRNA abundance, total cellular RNA from 
transfected HEK293 cells was isolated using the RNeasy kit (Qiagen, 
Valencia, CA). Real time PCR analysis (Taqman) was performed in a 
96-well format on an ABI 7700 SDS machine (Perkin Elmer) and 
analyzed with SDS version 1.6.3 software. RNA samples (25 ng) were 
mixed with 0.3 fXM. each primer, 0.1 jum probe, 5.5 mM MgCl 2 and 0.3 mM 
each dNTP, 0.625 unit of AmpliTaq Gold RNA polymerase, 6.25 units of 
Multiscribe reverse transcriptase, and 5 units of RNase inhibitor in 
Taqman buffer A from Perkin Elmer. The reverse transcription was 
performed at 48 °C for 30 min. After denaturing at 95 °C for 10 min, 
PCR amplification reactions were conducted for 40 cycles at 95 °C for 
15 s and at 60 °C for 1 min. The EPO primer and probe set (GACTGT- 
GTGCTCTGTGCACT, CTCTCAAAGTGCTGGGATTGCA, FAM-TGA- 
GCCACCGCACCCAGCCCCCA-TAMRA) and the VP16 primer and 
probe set (CATGACGATTTCGATCTGGA, CTACTTGTCATCGTCGTC- 
CTTG, FAM-ATCGGTAAACATCTGCTCAAACTCGA-TAMRA) were 
used to measure the human EPO and ZFP expression levels, respect- 
ively. The GAPDH primer and probe set (CCTTTTGCAGACCACAGT- 
CCA, GCAGGGATGATGTTCTGGAGA, FAM-CACTGCCACCCAGAA- 
GACTGTGG-TAMRA) were used to monitor the internal control 
GAPDH mRNA. The abundance of expressed ZFPs was controlled for at 
both the RNA level as described above, but also by Western blotting. 
This analysis was performed by resolving 10 fig of whole cell lysates on 
a 10-20% Tris-HCl polyacryl amide gel containing SDS. Proteins were 
transferred onto a nitrocellulose membrane using 1 x SDS, 20% Meth- 
anol, and then the filter was blocked using 5% nonfat dry milk for 1 h 
at room temperature. Blotting was done with anti-Flag M2 monoclonal 
antibody (Sigma) diluted 1:1000 in 5% (w/v) nonfat dry milk, 0.1% 
PBS-Tween) for 1 h at room temperature. Subsequently, an anti -mouse 
horseradish peroxidase conjugate (Amersham Pharmacia Biotech) was 
used at a 1:3000 dilution in 5% (w/v) nonfat dry milk, 0.1% PBS-Tween 
for 1 h at room temperature. All washes were done in 0.1% PBS-Tween. 
The protein bands were detected by the ECL system (Amersham Phar- 
macia Biotech). 

Endogenous EPO expression was assayed either in response to tran- 
sient transfection with ZFP effectors or in response to the induction of 
pEPOZFP-862a following stable transformation. Assays were per- 
formed at the indicated time after transfection. Either RNA was ex- 
tracted as indicated above, or the culture medium was harvested for 
measurement of secreted EPO protein using the human EPO ELISA kit 
(R&D Systems, Minneapolis, MN). 

Chromatin Analysis, Chromatin Immunoprecipitation, DNase I, and 
Micrococcal Nuclease Mapping — Chromatin immunoprecipitation was 
performed using a ChIP assay kit according to the instructions from the 
manufacturer (Upstate Biotechnology, Inc., Lake Placid, NY). Approx- 
imately 2 million cells were cross-linked with 1% formaldehyde for 10 
min, washed with PBS, and lysed in lysis buffer. The cell lysate was 
sonicated on ice, resulting in a DNA fragment length of approximately 
500 bp. After removing cell debris by centrifugation, immunoprecipita- 
tion was performed in ChIP dilution buffer overnight with 3 p.g of VP16 
1-21 antibody (Santa Cruz) or 10 fig of anti acetylated H3 antibody 
(Upstate Biotechnology). The antibodies were collected with Protein 
A-agarose and washed. The cross-linking was reversed by incubation at 
65 °C for 4 h in the presence of 200 mM NaCl. The DNA was recovered 
by phenol/chloroform extraction and the abundance of particular se- 
quences quantitated using real time PCR with the ABI 7700 sequence 
detector from Perkin Elmer/Applied Biosystems as described above. 
The standard Taqman reagents and the universal thermal cycling pa- 
rameters were used (10 min at 95 *C, followed by 40 cycles of 15 s at 
95 °C and 1 min at 60 °C). The primers had the following sequences: 
-1838F, TGGTACATCTGGTGCATTGTTG; 1838R, AAATAATAGA- 
CACACACAAGATAGTGAAAGC; -927F, ACACCACAGGTCAAATA- 
AACAGATG; -927R, ACTTTTAGTGCACAGAGCACACAGT; -363F, 
GGCTTCCAGACCCAGCTACATT; -363R, GGTCTTGGGCGGAGAC- 
TCA; +538F, GTGCCAGTGGAGAGGAAGCT; +538R, CAAACTTCA- 
ATCCTGGTGTGACA; +6839F, TGGGAGTACAGGCGTGAGC; 
+6839R, GGGAAAATGATGAAAGAGAAATCAA; hGAPDH-F, GACA- 
TCAAGAAGGTGGTGAAG; hGAPDH-R, AGCTTGACAAAGTGGTCG- 
TTG. The Taqman probes were labeled with FAM at the 5' end and with 
TAMRA at the 3'end. They had the following sequences: -1834P, AAG- 
GCGGTGACCCCCCTGGAC; -927P, CATTGTGCAGGACACACATG- 
CACCTTG; -363P, CGGAACTCAGCAACCCAGGCATCT; +538P, 
TGGGCGCTGGAGCCACCACTTA; +6839P ACCGCGCCAGCCCGTG- 
TC; hGAPDH-P, CACTGAGCACCAGGTGGTCTCCT. For nuclease di- 
gestion, nuclei were isolated from HEK293 cells essentially as described 
(25). The nuclei were resuspended at 10,000/^xl in digestion buffer. This 
suspension was aliquoted (100 id) into tubes and digested with nuclease 



Table II 

The equilibrium dissociation constants of zinc finger proteins 



Name Target sequence K d 







nM 


EPOZFP-862a 


GCGGTGGCTc 


6 


EPOZFP-862b 


GCGGTGGCTc 


9 


EPOZFP-862c 


GCGGTGGCTc 


2 


EPOZFP-545a 


GGTGAGGAGt 


7.5 


EPOZFP-545b 


GGTGAGGAGt 


7 


EPOZFP-535 


GAGGTGGGGa 


0.23 


EPOZFP-233 


TGGGTCGCGg 


12 


EPOZFP-82 


GTGGGGGAGa 


12 


EPOZFP-72 


GCGGGTGGGg 


22.5 


Spl 


GGGGCGGGG 


40 


Zif268 


GCGTGGGCGt 


2 



for 5 min at 22 °C. DNase I or micrococcal nuclease (Worthington) was 
added at 0, 5, 10, 20, or 40 units/ml for nuclei or 100-fold lower concen- 
trations for naked DNA. The reaction was stopped by adding Qiagen 
buffer AL, followed by proteinase K treatment and purification (DNAe- 
asy kit, Qiagen). The recovered DNA was digested to completion over- 
night at 37 °C with the indicated restriction endonuclease and then 
concentrated by ethanol precipitation. The entire sample is loaded onto 
an agarose gel of 1-2%, before electrophoresis and transfer to a Nytran 
membrane (Schleicher & Schuell). Indirect end labeling was used to 
detect the sites of nuclease cleavage. Hybridization was carried out in 
Rapid Hyb buffer (Amersham Pharmacia Biotech) using PCR-amplified 
genomic DNA fragments as indicated. The DNA probes were radiola- 
beled with [a- 32 P]dCTP to a specific activity of 10 s cpm/ng DNA. Hy- 
bridization and subsequent washings were performed at 65 °C. Mem- 
branes were visualized in the Phosphorlmager. 

RESULTS 

Characterization of Synthetic ZFPs — We made use of exist- 
ing information concerning the recognition of specific se- 
quences by zinc finger domains to design 10 proteins that we 
predicted would recognize sequences in the 5' flank of the 
human EPO gene. The strategy for assembly is illustrated in 
Fig. 1, and the details of design are summarized in Table I. The 
details of design and selection of zinc fingers capable of recog- 
nizing particular DNA sequences will be described in detail 
elsewhere. 2 The positions of the sequences within the human 
EPO gene that are targeted by our ZFP designs are illustrated 
in Fig. 2A. The ZFPs were designed to bind at four sites; site 1 
and site 2 flank an AIu element. Alu elements are known to 
position nucleosomes (28, 29); we chose these flanking sites 
because we wished to avoid sites within the presumed position 
of stable histone-DNA interactions. Sites 3 and 4 were located 
-72 to -300 bp from the transcription start site (+1). We 
wished to introduce ZFPs here because they would be adjacent 
to a known region (-61 to -45 bp) important in the regulation 
of the human EPO gene (30). We expressed these diverse zinc 
finger proteins in recombinant form and purified them (Fig. 
2B). In the nomenclature that we use to describe these pro- 
teins, each of which interacts with 9 bp of DNA, the number 
after the prefix EPOZFP delineates the position of the first 
nucleotide of the recognition sequence relative to the transcrip- 
tion start site (+ 1). Thus, EPOZFP-862 describes a zinc finger 
protein that binds to a 9-bp sequence that is positioned -862 
bp to the 5' of the transcription start site. Where there are 
multiple designs to particular sequence, this is indicated by, 
e.g., -862a, -862b, and -862c (see Table I). The recombinant 
proteins were used in gel retardation assays to quantitate their 
binding afrinities for naked DNA (Fig. 2C). These results are 
tabulated in Table II. Our results indicate that we have de- 
signed a range of zinc finger DNA binding domains that exhibit 
a range of affinities for particular DNA sequences in the 5' 
flank of the EPO gene. These affinities range from dissociation 
constants of 0.23 nM (EPOZFP-535) to greater than 20 nM 
(EPOZFP-72). We next wished to relate these proteins and 
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Fig. 3. The transcriptional activa- 
tion properties of the EPOZFPs. A, 

luciferase reporter assays involved the co- 
transfection of the ZFP expression con- 
struct (50 ng) together with the respective 
luciferase reporter DNA (900 ng) (see "Ex- 
perimental Procedures") and CMV 0-gal 
as an internal control into HEK293 cells. 
After 40 h, cell lysates were harvested 
and the luciferase activity induced by 
each EPOZFP measured as indicated. 
The pcDNANVF and -862c-NF provide 
controls using proteins lacking the zinc 
fingers and VP16 activation domain, re- 
spectively. B, ELISA assays on endoge- 
nous EPO expression. EPO protein syn- 
thesis was assayed at 40 h after 
expression of the various EPOZFPs indi- 
cated as described (see "Experimental 
Procedures"). SBS 579 contains an ex- 
pression vector for a nonspecific ZFP. C, 
quantitation of EPO mRNA by real time 
PCR (Taqman). EPO mRNA was assayed 
by real time PCR as described (see "Ex- 
perimental Procedures") after expression 
of the various EPOZFPs for 40 h as de- 
scribed, SBS 579 contain an expression 
vector for a nonspecific ZFP. D f quantita- 
tion of the various EPOZFP mRNAs by 
real time PCR. The mRNA synthesized 
from the various transiently transfected 
EPOZFP expression constructs and their 
controls is quantitated. E, immunoblot of 
protein synthesized from the various 
transfected EPOZFPs and their controls. 
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Fig. 4. Inducible expression of the endogenous EPO gene in response to synthesis of EPOZFP-862. A, EPO expression and Dox 
dose-response curve for stably transformed 293 cells containing copies of the EPOZFP-862c gene under control of a Tet-responsive full-length 
CMV promoter (see "Experimental Procedures"). Conditioned medium was harvested 48 h after the addition of Dox at the concentrations indicated 
and analyzed by an EPO ELISA kit (see "Experimental Procedures"). B, EPOZFP-862 immunoblot. Protein extracts from cells treated with the 
indicated Dox concentrations were analyzed by immunoblotting with the anti-Flag antibody. Extracts were prepared from cells 48 h after 
induction. C, Northern blot analysis of EPO mRNA induced by EPOZFP-862. EPO mRNA signals were shown for EPOZFP-862c-transfected 293 
cells and for Dox-induced EPOZFP-862 cells (lanes 2 and 4 y respectively). As a control untransfected (lane 1) and Dox-minus cells (lane 3) are also 
used to extract mRNA. The EPO probed membrane was stripped and rehybridized with a 32 P-labeled riboprobe containing antisense fragments 
that hybridize to ZFP mRNA as well as to the human /3-actin gene that served as as a loading control. 



their recognition of these diverse sequences to their capacity to 
activate transcription from the EPO promoter when fused to a 
luciferase reporter. 

Transcriptional Regulation of the EPO Promoter Using 
ZFPs — Intensive analysis has defined the key cis-acting ele- 
ments for regulation of the human EPO promoter. Erythropoi- 
etin is a protein hormone produced primarily in the kidneys 
and liver which controls the biogenesis of erythrocytes. Expres- 
sion of the EPO gene is primarily controlled in a tissue-specific 
manner that is sensitive to oxygen tension. Hypoxia induces 
EPO gene expression at the transcriptional level through the 
action of a hypoxia response element, which interacts with 



HIF-1, a heterodimer of the PAS-family transcription factor 
HIF-la and ARNT, the aryl hydrocarbon receptor nuclear 
translocator (31). The hypoxia response element is located at 
the 3' flank of the EPO gene (31-33). A homodimer of HNF-4 
also binds at this site (32, 33). The 5' flank of the EPO gene has 
also been determined to influence transcription; however, the 
only known regulatory elements that contribute to EPO gene 
activity within this DNA segment are located proximal to the 
start site of transcription (between -61 and -45 relative to the 
start at +1) (30). All ZFP transcription factors activate the 
luciferase reporter gene (Fig. 3A). These constructs contain 
three reiterations of the ZFP recognition site upstream of the 
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mutant versions of this sequence to which EPOZFP-862 binds with varying affinity were inserted into the luciferase reporter construct and the 
capacity of EPOZFP-862 protein to activate these reporters quantitated relative to CMV-driven j3-gal as a control. B, luciferase assays versus 
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24 h of induction with Dox (0.05 /Ag/ml), the activities of the luciferase and internal control j3-gal were measured. The normalized luciferase 
activities were graphed against the corresponding K d values, which are represented on a log scale. Standard deviations are shown. 



SV40 minimal promoter. Expression of a ZFP that lacks the 
VP 16 activation domain does not activate luciferase (Fig. 3A, 
NVF). The range of activation varies between 2-fold 
(EPOZFP-233) and 16-fold (EPOZFP-862c). Surprisingly, in 
the data shown in Fig. 3A and in all subsequent repetitions of 
this experiment, there is no strong correlation in these tran- 
sient assays between the capacity of a ZFP to activate tran- 
scription and the measured binding affinity (Table II). Both the 
strong (EPOZFP-535), K d = 0.23 nM) and weak (EPOZFP-72, 
K d — 22.5 nM) binders activate transcription with comparable 
efficiency. Although these results indicate that our basic design 
of ZFP transcription factors is robust, there are several vari- 
ables to interpretation of these data. The newly synthesized 
ZFPs might have varying stability, or be deficient in entry into 
the nucleus, or might be engaged in non-productive protein- 
protein interactions with inhibitory factors. We have controlled 
for these possibilities in various ways. Western blotting dem- 
onstrates that all of the ZFPs accumulate to high levels in cells 
(see Fig. 3E) and the cells accumulate comparable levels of ZFP 
encoding mRNA (see Fig. 3D). In other experiments, we have 
established that the ZFPs accumulate uniformly in nuclei and 
that they retain identical capacities within nuclear extracts 
prepared from these cells to recognize that DNA sequences to 
which they bind in vitro (data not shown). 

In order to further test the relationship between DNA bind- 
ing affinity and the capacity of the ZFPs to activate the EPO 
gene by virtue of their binding at a particular site, we estab- 
lished an inducible cell line expressing the most effective acti- 
vating ZFP in our transient expression assays, EPOZFP-862. 



Our strategy was to transfect pTO-EPOZFP-862c into the 
T-Rex -292™ cell line (see "Experimental Procedures"), which 
stably expresses the Tet repressor and allows for the regulated 
expression of a gene of interest under the control of a Tet- 
responsive promoter when doxycycline is added to the culture 
medium. Twenty-one stable clones were isolated and analyzed 
for ZFP expression and erythropoietin expression in a Dox-de- 
pendent manner (Fig. 4, A and B). We used Northern blotting 
to confirm that the endogenous human EPO chromosomal locus 
was activated to synthesize a full-length poly(A) + mRNA in 
response to either transient or stable expression of 
EPOZFP-862 (Fig. 4C). 

We next investigated the activation of a luciferase reporter to 
which was fused at the same site relative to the SV40 minimal 
promoter three reiterations of the EPOZFP-862 recognition 
sequence and mutant forms of this sequence to which 
EPOZFP-862 has reduced binding affinity (Fig. 5A). Under 
these defined conditions, there is a strong correlation between 
the ZFP affinity for a particular sequence and the capacity to 
activate transcription (Fig. 5B). We suggest that, within one 
defined chromosomal or chromatin context, a particular ZFP 
will regulate transcription in a manner that largely reflects 
relative binding affinity for the cognate sequence. Transcrip- 
tional activation occurs most efficiently when the K d is progres- 
sively reduced from 10 nM and is relatively inefficient when the 
K d exceeds 30 nM (Fig. 5B\ This contrasts with the lack of 
correlation between binding affinity and transcriptional re- 
sponse apparent when different ZFP binding sites relative to 
the transcription start site are used in luciferase reporter as- 
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+ 791 Xbal site was used as a probe. Relative positions of DNase I-hypersensitive sites (+4591 and +4091) plus the position of the hypoxic response 
element are indicated. 



says and in activation experiments on the endogenous chromo- 
somal locus (Table II, Fig. 3, A~C). 

We next compared the capacity of ZFPs that would recognize 
sequences present in the 5' flank of the EPO gene in vitro on 
naked DNA and in vivo on transiently transfected DNA, to 
activate the endogenous chromosomal locus. In Fig. 3 (A-C), we 
compare the capacity of ZFPs to function in the luciferase 
reporter assay with their capacity to activate the endogenous 
EPO gene as assayed both by quantitative real time PCR 
(Taqman) and by ELISA assays. We find that, in contrast to the 
pleiotropic activation of the transiently transfected reporter 
construct (Fig. 3A), activation of the endogenous chromosomal 
EPO promoter is found with those ZFPs targeted to one se- 
lected target in the promoter (Fig. 3, B and C). This is the 
binding site for EPOZFP-862. Three individual protein de- 
signs with varying affinity for DNA (EPOZFP-862a, -b, and -c) 
activate the endogenous chromosomal EPO gene at this site. 
All other ZFPs targeted elsewhere in the 5 '-flanking sequences 
of the EPO gene are reduced in their capacity to activate 
transcription (Fig. 3C). These results indicate that there exist 
constraints on ZFP function on the endogenous chromosomal 
EPO gene. 

Analysis of ZFP Access to the Chromatin Infrastructure — 
There are no documented regulatory elements in the 5' flank of 
the EPO gene beyond the immediately proximal promoter re- 
gion between -61 and -45 (30). All other defined regulatory 
elements lie 3'. Thus, the capacity of three ZFPs targeted to the 
site 1 region (Fig. 2A) at -862 would appear to generate a novel 
cis-acting element de novo. In order to further test this idea, we 



performed a DNase I hypersensitivity analysis of the promoter 
region. We did not detect preferred sites of cleavage in the 
vicinity of -862, nor were any sites detectable within 3 kilo- 
base pairs between -460 and -3673 5' of the EPO transcrip- 
tion unit (Fig. 6, A and B). This lack of DNase I hypersensitivity 
might be anticipated because the promoter is normally silent in 
293 cells (30-33). In contrast to the lack of DNase I hypersen- 
sitivity in the immediate vicinity of the quiescent EPO pro- 
moter, robust sites of DNase I hypersensitivity are found at the 
3' flank of the previously documented hypoxia response ele- 
ment (Fig. 6, A and C). This result leads to the observation that 
the EPOZFP-862a, -b, and -c proteins are not gaining access 
through a preexisting DNase I-hypersensitive site in 
chromatin. 

Our evidence for the EPOZFP-862a, -b, and -c proteins 
functioning at Site 1 (Figs. 2 and 3) led us to employ a more 
direct strategy to test our hypothesis that these ZFPs were 
gaining access to the chromatin infrastructure at this site. 
Chromatin immunoprecipitation is a powerful method to dem- 
onstrate selective enrichment of a particular regulatory protein 
at a particular site in chromatin (34). We chose to test for 
enrichment of the EPOZFP-862 protein both at site 1 and at a 
second site of perfect identity located at +6826 relative to the 
transcription start site. After cross-linking and immunoprecipi- 
tation using antibodies against VP16 (see "Experimental Pro- 
cedures"), we made use of three primer pairs centered on 5'- 
flanking regions -1838, -927, -363 and two sets of primer 
pairs centered on sequences downstream from the start site of 
transcription at +538 and the other site of perfect identity 
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an antibody against the VP16 domain or EPOZFP-862c (see "Experimental Procedures"). Five primer pairs were used to scan the distribution of 
EPOZFP-862c along the EPO locus. A control nonspecific antibody was also used for immunoprecipitation. A primer pair contained within the 
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by deproteinization and subsequent digestion with PstL Nucleosomal ladders were visualized by indirect end labeling with a Pstl-Hincll fragment 
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+6839. The abundance of these DNA sequences was quanti- 
tated relative to a GAPDH internal control. Our results show a 
30-70-fold enrichment between the -927 and -363 sites for 
the EPOZFP-862 protein (Fig. 7A). This distribution might be 
expected due to our use of DNA fragments of 500 bp in size in 
our chromatin immunoprecipitation assays. Both the -927 and 
-363 fragments would contain the -862 site within this size 
distribution. An additional important control is the loss of 
recovery of DNA fragments at both the 5' (-1838) and 3' 
(+538) regions flanking the -862 site (Fig. 7A). Finally, it is 
noteworthy that an identical sequence, GCGCTGGCT, for rec- 
ognition by EPOZFP-862 positioned at +6839 is not recovered 
at all in this experiment. (Fig. 7A). We conclude that parame- 
ters other than DNA sequence per se determine the access of 
the EPOZFP-862 protein to the cognate sequence. 

In order to better understand the access of EPOZFP-862 
protein to the endogenous chromosomal EPO locus, we made 
use of micrococcal nuclease digestion of chromatin. This en- 
zyme is considerably smaller than DNase I and gains efficient 
access to the linker DNA between nucleosomes (35). Sequence 
analysis of the 5' flank of the EPO gene indicates the presence 
of Alu elements, which are known to position nucleosomes (28, 
29). The EPOZFP-862 site is at the 5' of the Alu element (Fig. 
7A), which we would predict would lie in linker sequence adja- 



cent to the nucleosome core. To test this hypothesis, we di- 
gested chromatin with micrococcal nuclease and then used 
indirect end labeling to determine the positions of nucleosomes. 
We find that nucleosomes are positioned such that linker DNA 
is accessible at -490 and -670 with a major region of micro- 
coccal nuclease sensitivity beginning at -852. Thus, a domain 
of micrococcal nuclease sensitivity corresponds to a site of 
responsiveness to EPOZFP-862. 

As a test of the capacity of the EPOZFP-862 protein to 
initiate events leading to chromatin remodeling, we examined 
the chromatin structure of the endogenous EPO gene in the cell 
line that induced expression of EPOZFP-862 in response to 
doxycycline. In the absence of EPOZFP-862 expression, there 
is no significant expression of the endogenous EPO gene (Figs. 
3 and 4) and there is no DNase I hypersensitivity over the 
promoter (Fig. 6, A and B). In the presence of doxycycline, the 
endogenous EPO locus is transcriptionally activated (Figs. 3 
and 4) and a DNase I-hypersensitive site appears over the 
promoter (Fig. 8, A and B> lanes 1 and 2). In the absence of 
doxycycline, no DNase I-hypersensitive site appears (Fig. 8, A 
and B, lanes 3 and 4). These results confirm the capacity of the 
synthetic transcriptional regulators to remodel chromatin in a 
targeted manner. 
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DISCUSSION 

The primary conclusion from these experiments is that the 
activity of synthetic zinc finger transcription factors at an 
endogenous chromosomal locus (Figs. 3 {B and C) and 4) differs 
significantly from the determinants of protein binding to naked 
DNA (Fig. 2 and Table II) and from the capacity of these 
diverse proteins to activate transiently transfected templates 
(Figs. 3A and 5). Although there is little difficulty in activating 
transcription from transiently transfected templates (Fig. 3A), 
there are constraints in gaining access to sites within endoge- 
nous chromosomes in order to regulate expression (Figs. 3 (B 
and C) and 6-8). These observations are consistent with ear- 
lier work on the mouse mammary tumor virus promoter in 
which the access of the transcriptional regulators NF1 and Oct 
1 is dependent on the nature of the chromatin infrastructure 
(36-38). The prior assembly of nucleosomes on the NF1 and 
Oct 1 binding sites precludes their access to the MMTV pro- 
moter in chromatin (36, 37). Chromatin remodeling events 
dependent on the glucocorticoid receptor are necessary to allow 
NF1 and Oct 1 to bind and function in transcriptional activa- 
tion (38). Our observations that a ZFP binding site lies in an 
accessible linker region between nucleosomes and that this 
facilitates stable association with chromatin is also consistent 
with earlier work on the natural zinc finger protein TFIIIA 
(14-26). 

Earlier experiments have demonstrated the capacity of de- 
signer zinc finger proteins to regulate transcription on tran- 
siently transfected templates (9-13). These templates are in- 
efficiently assembled into chromatin if at all (39). For the 
MMTV promoter, the chromatin assembled on transiently 
transfected templates poses no measurable impediment to the 
association of NF1 and Oct 1 (36). In contrast, chromatin as- 
sembled on replicating episomes or within endogenous chromo- 
somes poses a much severe impediment to transcription factor 
binding (36). In two instances, successful regulation of tran- 
scription by ZFP transcription factors within a chromosomal 
locus has been described (9, 13). These were the regulation of 



the fusion oncogene for BCR-ABL by the DNA binding domain 
of a three finger zinc finger protein lacking any other regula- 
tory domains (9), and the regulation of the erbB-2 locus by 
ZFPs to which were fused both activation and repression do- 
mains (13). The ZFP that functioned on the BCR-ABL oncogene 
has a very weak binding affinity for DNA, yet this protein was 
targeted to a randomly integrated cDNA construct represent- 
ing the chromosomal breakpoint where the two genes were 
fused (9). These two structural features may very well repre- 
sent particularly open regions of chromosomal structure. The 
ZFPs that regulate the erbB-2 chromosomal locus have much 
higher affinities for DNA (K d = <1 nM) and are targeted to 
5 '-untranslated region of the erbB-2 oncogene (13). There is no 
reason to anticipate that such a region of chromatin would be 
more accessible to regulatory factors than other regions, and in 
fact we have tested several ZFPs that recognize the 5 '-untrans- 
lated region of the EPO gene that do not activate transcription 
in our system. We predict that the ZFPs that regulate the 
endogenous erbB-2 locus do so by binding to an accessible 
linker DNA in chromatin. Future experiments will test this 
possibility. 

We demonstrate that the EPOZFP-862 is highly enriched in 
chromatin in the vicinity of the designated recognition site in 
chromatin and that other proteins that selectively bind this site 
show comparable transcriptional regulatory effects (Figs. 3 and 
7). There are no known transcriptional regulatory elements 
that normally contribute to EPO gene expression at this site 
(30-34). In fact, this region of the EPO gene is refractory to 
DNase I cleavage consistent with the lack of regulatory DNA 
(Figs. 6 and 8). Thus the targeting of EPOZFP-862 to a linker 
region flanking a nucleosome positioned on an Alu element 
generates a novel cis-acting element (Figs. 3 and 7). 

These observations suggest an improved strategy for dissect- 
ing the requirements for eukaryotic gene control. First, it is 
important to determine the underlying chromatin infrastruc- 
ture and identify linker DNA segments between positioned 
nucleosomes. Second, it should be possible to design ZFPs that 
will recognize these accessible sequences such that they bind 
stably in chromatin (Figs. 7 and 8). Third, it would then be 
possible to systematically recruit a variety of transcriptional 
activation or repression domains to that particular site in order 
to remodel chromatin and regulate transcription. In this way, it 
should be possible to manipulate human genes in their endog- 
enous chromosomal context and more effectively design ZFPs 
with therapeutic potential. 
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